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PRODUCTION OF PROTEIWS 
USING HOMOLOGOUS RECOHBIHATION 



INTRODUCTION 

Technical Field 

The field of this invention is the 
expression of mammalian proteins. 

Background 

The discoveries of restriction enzymes, 
cloning, sequencing, reverse transcriptase, and 
monoclonal antibodies has resulted in extraordinary 
capabilities in isolating, identifying, and 
manipulating nucleic acid sequences* As a result of 
these capabilities, numerous genes and their 
transcriptional control elements have been identified 
and manipulated. The genes have been used for 
producing large amounts of a desired protein in 
heterologous hosts (bacterial and eulcaryotic host cell 
systems ) * 

In many cases, the process of obtaining 
coding sequences and eliciting their expression has 
been a long and arduous one. The identification of 
the coding sequence, either cDNA or genomic DNA, has 
frequently involved the construction of libraries, 
identification of fragments of the open reading frame, 
examining the flanking sequences, and the like. In 
mammalian genes where Introns are frequently 
encountered, in many instances, the coding region has 
been only small fraction of the total nucleic acid 
associated with the gene. In other cases, pseudogenes 
or multi-membered gene families have obscur d the 
ability to isolate a particular gene of interest. 
Nevertheless, as techniqu s have improved, there has 



^. 

xieeii a continuous parade of successful i.dent:. fixations 
and isolation of gsnes of interest- 

In many situations one is primarily 
interested in a source of the protein product. The 
5 cell type in the body which produces the protein is 

frequently an inadequate source, since the protein may 
be produced in low amounts, the protein may only be 
produced in a differentiated host cell which is only 
difficultly grown in culture, or the host cell, 

10 particularly a human cell, is not economic or 

efficient in a culture process for production of the 
product. There is, therefore, significant interest in 
developing alternative techniques for producing 
proteins of interest in culture with cells which 

15 provide for economic and efficient production of the 
desired protein and, when possible, appropriate 
processing of the protein product. 

Relevant Literature 

20 Mansour et al. , Nature , 336:348-352 (1988), 

describe a general strategy for targeting mutations to 
non-selectable genes. Weidle et al . , Gene , 66 t 193- 
203, (1988), describe amplification of tissue-type 
plasminogen activator with a DHFR gene and loss of 

25 amplification in the absence of selective pressure. 
Murnane and Yezzi, Somatic Cell and Molecular 
Genetics , 14:273-286, (1988), describe transformation 
of a human cell line with an integrated selectable 
gene marker lacking a transcriptional promoter, with 

30 tandem duplication and amplification of the gene 
marker. Thomas and Capecchi, Cell , 51:503-512, 
(19871, describe site-directed mutagenesis by gene 
targeting in mouse embryo-derived stem cells. Song 
et al. , Proc. Natl. Acad. Sci . USA , 84:6820-6824, 

35 (1987), describe homologous recombination in human 
cells by a two staged integration. Liskay et al., 
"Homologous Recombination Between Repeated Chromosomal 
Sequ nc s in Mouse Cells," Cold ;3pring Harbor, Symp. 



ffuant. Biol . 42*13-189, (1984);. describe Integra bio.'; 
of. twn different mutations o^. t.he same gene 
homologous recombination between the mutant genes. 
Rubnitz and Subramani, Mol. and Cell. Biol . 4:2253- 
5 2258, (1984), describe the minimum amount of homologi^ 
required for homologous recombination in mammalian 
cells. Kim and Smithies, Nucl. Acids. Res . 16:8887- 
8903, (1988), describe an assay for homologous 
recombination using the polymerase chain reaction. 

10 

SUMMARY OF THE INVENTION 
Expression of mammalian proteins of interest 
is achieved by employing homologous recombination for 
integration of an amplifiable gene and other 

15 regulatory sequences in proximity to a gene of 

interest without interruption of the production of a 
proper transcript. The region comprising the 
amplifiable gene and the gene of interest may be 
amplified, the genome fragmented and directly or 

20 indirectly transferred to an expression host for 

expression of the target protein. If not previously 
amplified, the target region is then amplified, and 
the cell population screened for cells producing the 
target protein. Cells which produce the target 

25 protein at high and stable levels are expanded and 
used for expression of the target protein. 

BRIEF DESCRIPTION OF THE DRAWINGS 
FIG. 1 is a diagrammatic illustration of the 
30 plasmid pCG.l showing the sequence of the modified 
polylinker; 

FIG. 2 is a diagrammatic illustration of the 
construction of the plasmid pCG.HRl; 

FIG. 3 is a diagrammatic illustration of the 
35 result of targeting the EPO locus by homologous 

recombination with the DNA from pCG.HRl cut with Not I; 
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PXG» 4 is ci diagrai-nmatic illust-rat xou of the PCH 
aitiplication fragment produced from i::ells in which a 
homologous recombination event has occurred. 



5 DESCRIPTION OF SPECIFIC EMBODIMENTS 

Methods and compositions are provided for 
production of mammalian proteins of interest in 
culture. The method employs homologous recombination 
in a host cell for integrating an amplifiable gene in 

10 the vicinity of a target gene, which target gene 
encodes the protein of interest. The region 
comprising both the amplifiable gene and target gene 
will be referred to as the amplifiable region. The 
resulting transformed primary cells may now be 

15 subjected to conditions which select for 

amplification, or the amplification may be performed 
subsequently. "Transform" includes transform, 
trans feet, transduce, conjugation, fusion, 
electroporation or any other technique for introducing 

20 DNA. into a viable cell. The chromosomes or DNA of the 
transformed cells are then used to transfer the 
amplifiable region into the genome of secondasry 
expression host cells, where the target region, if 
not previously amplified sufficiently or at all, is 

25 further amplified. The resulting cell lines are 
screened for production of the target protein and 
secondary cell lines selected for desired levels of 
production, which cells may be expanded and used for 
production of the desired protein in culture. 

30 The primary cell may be any mammalian cell 

of interest, particularly mammalian cells which do not 
grow readily in culture, more particularly primate 
cells, especially human cells, where the human cells 
may be normal cells, including embryonic or neoplastic 

35 cells, particularly normal cells. Various cell types 
may be employed as the primary cells, including 
fibroblasts, particularly diploid skin fibroblasts, 
lymphocytes, epithelial cells, neurons, endothelial 



cells, or other fiomatic ceils, or germ c*illt: , 01" 
paiticular interr-st ^ire skin fibroblasts, which can be 
readily propagated to provide for large numbers of 
normal cells, embryonic kidney cells, and the like. 
5 These cells may or may not be expressing the gene of 

interest. In those instances where the target gene is 
inducible or only expressed in certain differentiated 
cells, one may select cells in which the target gene 
is expressed, which may require immortalized cells 

10 capable of growth in culture. 

A number of amplifiable genes exist, where by 
appropriate use of a selection agent, a gene 
integrated in the genome will be amplified with 
adjacent flanking DNA. Amplifiable genes include 

15 dihydrofolate reductase, metallothionein-I and -II, 
preferably primate metal lothionein genes, adenosine 
deaminase, ornithine decarboxylase, etc. The 
amplifiable gene will have transcriptional signals 
which are functional in the secondary or expression 

20 host and desirably be functional in the primary host, 
particularly where amplification is employed in the 
primary host or the amplifiable gene is used as a 
marker * 

The target genes may be any gene of 
25 interest, there already having been a large number of 
proteins of interest identified and isolated with 
continual additions to the list. Proteins of 
interest include cytokines, such as interleukins 1-10; 
growth factors such as EGF, FGf , PDGF, and TGF; 
30 somatotropins; growth hormones; colony stimulating 
factors, such as and GM-CSF; erythropoietin; 

plasminogen activators, such as tissue and urine; 
enzymes, such as superoxide dismutase; interferons? 
T-cell receptors; surface membrane proteins; insulin; 
35 lipoproteins; ai-antitrypsin; CD proteins, such as 
CDS, 4, 8, 19; clotting factors, e.g.. Factor VIIIc 
and von Will brands factor; antic lotting factors, such 
as Protein C- atrial naturetic factor, tximor necrosis 
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factor? V.ransport proteins; howinq- receptors; 
addresRlns; regulatory prov-e.ins? etc. 

For homologous recombination, constructs 
will be prepared where the amplifiable gene will be 
5 flanked on one or both sides with DNA homologous with 
the DNA of the target region. The homologous DNA will 
generally be within 100 kb, usually 50 kb, preferably 
about 25 kb, of the transcribed region of the target 
gene, more preferably within 2 kb of the target gene. 
10 By gene is intended the coding region and those 
sequences required for transcription of a mature 
mRNA. The homologous DNA may include the 5 '-upstream 
region comprising any enhancer sequences, 
transcriptional initiation sequences, the region 5' of 
15 these sequences, or the like. The homologous region 
may include a portion of the coding region, where the 
coding region may be comprised only of an open reading 
frame or combination of exons and introns , The 
homologous region may comprise all or a portion of an 
20 intron, where all or a portion of one or more exons 
may also be present. Alternatively, the homologous 
region may comprise the 3 '-region, so as to comprise 
all or a portion of the transcription termination 
region, or the region 3' of this region. The 
25 homologous regions may extend over all or a portion of 
the target gene or be outside the target gene 
comprising all or a portion of the transcriptional 
regulatory regions and/or the structural gene. For 
the most part, the homologous sequence will be joined 
30 to the amplifiable gene, proximally or distally. 

Usually a sequence other than the wild-type sequence 
normally associated with the target gene will be used 
to separate the homologous sequence from the 
amplifiable gene on at least one side of the 
35 amplifiable gene. Some portion of the sequence may be 
the 5' or 3' sequence associated with the amplifiable 
gene, as a result of the manipulations 
associated with the amplifiable gene. 



a'liB noniologous rea.lons flanking the 
ampi.Af .v'^b.UJ* gene need not be .tdet-ttlcal to the. target 
region, where in vitro mutagenesis is desired. For 
example, one may wish to change the transcriptional 
initiation region for the target gene, so that a 
portion of the homologous region might comprise 
nucleotides different from the wild-type 5' region of 
the target gene. Alternatively, one could provide for 
insertion of a transcriptional initiation region 
different from the wild-type initiation region 
between the wild-type initiation region and the 
structural gene. Similarly, one might wish to 
introduce various mutations into the structural gene^ 
so that the homologous region would comprise 
mismatches, resulting in a change in the encoded 
protein. For example, a signal leader sequence would 
be introduced in proper reading frame with the target 
gene to provide for secretion of the target protein 
expression product. Alternatively, one might change 
the 3' region, e.g., untranslated region, 
polyadenylation site, etc. of the target gene. 
Therefore, by homologous recombination, one can 
provide for maintaining the integrity of the target 
gene, so as to express the wild- type protein under 
the transcriptional regulation of the wild-type 
promoter or one may provide for a change in 
transcriptional regulation, processing or sequence of 
the target gene. In some instances, one may wish to 
introduce an enhancer in relation to the 
transcriptional initiation region, which can be 
provided by, for example, integration of the 
amplifiable gene associated with the enhancer in a 
region upstream from the transcriptional initiation 
regulatory region or in an intron or even downstream 
from the target gene. 

In order to prepare the subject constructs, 
it will b necessary to know th sequenc which is 
targeted for homologous recombination. While it it*> 



reported -hat ^j: sequence of 14 barren corttplementarY tc 
a sequence in i g<?T)om^ may provide for hOTno logons 
recombination, normally the individual flanking 
sequences will be at least about 150 bp, and may be 
5 12 kb or more, usually not more than about 8 kb. The 
size of the flanking regions will be determined by the 
size of the known sequence, the number of sequences in 
the genome which may have homology to the site for 
integration, whether mutagenesis is involved and the 

10 extent of separation of the regions for mutagenesis, 
the particular site for integration, or the like. 

i The integrating constructs may be prepared 

in accordance with conventional ways, where sequences 
may be synthesized, isolated from natural sources, 

15 manipulated, cloned, ligated, subjected to in vitro 
mutagenesis, primer repair, or the like. At various 
stages, the joined sequences may be cloned, and 
analyzed by restriction analysis, sequencing, or the 
like. Usually the construct will be carried on a 

20 cloning vector comprising a replication system 

functional in a prokaryotic host, e.g., E. coli , and a 
marker for selection, e.g., biocide resistance, 
complementation to an auxotrophic host, etc. Other 
functional sequences may also be present, such as 

25 polylinkers, for ease of introduction and excision of 
the construct or portions thereof, or the like. A 
large number of cloning vectors are available such as 
pBR322, the pUC series, etc. 

Once the construct is prepared, it may then 

30 be used for homologous recombination in the primary 
cell target. Various techniques may be employed for 
integrating the construct into the genome of the 
primary cell without being joined to a replies t ton 
system functional in the primary host. See* fbr^ 

35 example, U.S. Patent No. 4,319,216, as well as the - ' 
references cited in the Relevant Literature section^ 
Alternatively, the construct may ba insert d into an 



appropriate vector, uuaalJ.y having a viral rep t.icatlon 
systeia, such as Sy40/ bcvine papilloma ^''ir^if:', 
adenovirus, or the like. The linear DNA sequence 
vector may also have a selectable marker tor 
identifying transf ected cells . Selectable markers 
include the neo gene, allowing for selection with 
G418, the herpes tk gene for selection with HAT 
medium, qpt gene with mycophenolic acid, 
complementation of an auxotrophic host, etc. 

The vector may or may not be capable of 
stable maintenance in the host. Where the vector is 
capable of stable maintenance, the cells will be 
screened for homologous integration of the vector into 
the genome of the host, where various techniques for 
curing the cells may be employed. Where the vector is 
not capable of stable maintenance, for example, where 
a temperature sensitive replication system is 
employed, one may change the temperature from the 
permissive ten^erature to the non-permissive 
temperature, so that the cells may be cured of the 
vector. In this case, only those cells having 
integration of the construct comprising the 
amplifiable gene and, when present, the selectable 
marker, will be able to survive selection. 

Where a selectable marker is present, one 
may select for the presence of the construct by means 
of the selectable marker. Where the selectable marker 
is not present, one may select for the presence of the 
construct by the amplifiable gene. -For the neo gene 
or the herpes tk gene, one could employ a medium for 
growth of the transforraants of about 0.1-1 g/ml of 
G418 or HAT medium respectively. Where DHFR is the 
amplifiable gene, the selective medium may include 
from about 0. 01-0. 25 /aM of methotrexate. 

In carrying out the homologous 
recombination, the DNA will be introduced into the 
primary cells . Techniques which may be used include 
calcium phosphate/DI-?. r-o ()recipitates , microinjection 



6 



10 

of DHA iiiUo the nucleus, electroporation, bacterial 
-prctoplaSw fusion with intact cells, trans fection, 
polycations, e.g., polybrene, polyornithine, etc., or 
the like. The DNA may be single or double stranded 
DNA, linear or circular. For various techniques for 
transforming mammalian cells, see Keovm et al . , 
Methods in Enzymology (1989), Keown et al . , Methods 
and Enzymolocrv (1990) Vol. 185, pp. 527-537 and 
Mansour et al. , Nature, 336:348-352, (1988). 

Upstream and/or downstream from the target 
region construct may be a gene which provides for 
Identification of whether a double crossover has 
occurred. For this purpose, the herpes simplex virus 
thymidine kinase gene may be employed since the 
presence of the thymidine kinase gene may be detected 
by the use of nucleoside analogs, such as acyclovir or 
gancyclovir, for their cytotoxic effects on cells that 
contain a functional HSV-tk gene. The absence of 
sensitivity to these nucleoside analogs indicates the 
aOjsence of the thymidine kinase and, therefore, where 
homologous recombination has occurred, that a double 
crossover event has also occurred. 

The presence of the marker gene as evidenced 
by resistance to a biocide or growth in a medium which 
selects for the presence of the marker gene, 
establishes the presence and integration of the target 
construct into the host genome. No further selection 
need be made at this time, since the selection will be 
made in the secondary expression host, where 
expression of the amplified target gene may be 
detected. If one wishes, one can determine whether 
homologous recombination has occurred by employing PGR 
and sequencing the resulting amplified DNA sequences. 
If desired, amplification may be performed at this 
time by stressing the primary cells with the - , 

appropriate amplifying reagent, so that multi-copies 
of the xarget gene are obtained. Alternatively, 
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ainplifxcatlon mf?y ev/ait transfer to tha secondary cell 
expression host 

High molecular weight DNA, greater than 
about 20kb, preferably greater than about 50kb DNA or 
5 preferably metaphase chromosomes are prepared froia the 
primary recipient cell strain having the appropriate 
integration of the amplification vector. Preparation 
and isolation techniques are described by Nelson and 
Housman, In Gene Transfer (ed. R. Kucherlapati) Plenum 

10 Press, 1986. The DNA may then be introduced in the 

same manner as described above into the secondary host 
expression cells, using the same or different 
techniques than employed for the primary cells. 
Various mammalian expression hosts are available and 

15 may be employed. These hosts includp CHO. cells, 

monkey kidney cells, C127 mouse fibroblasts, 3T3 mouse 
cells, Vero cells, etc. Desirably the hosts will have 
a negative background for the amplifiable gene or a 
gene which is substantially less responsive to the 

20 amplifying agent. 

The transformed cells are grown in selective 
medium containing eOxsut 0.01-0.5 f^li methotrexate and, 
where another marker is present, e.g., the neto gene, 
the medium may contain from about 0.1-1 mg/ml G418. 

25 The resistant colonies are isolated and may then be 
analyzed for the presence of the construct in 
juxtaposition to the target gene. This may be as a 
result of detection of expression of the target gene 
product, where there will normally be a negative 

30 background for the target gene product, use of PGR, 
Southern hybridization, or the like. 

The cells containing the construct are then ■ ■' ^ 
expanded and subjected to selection and amplification "^.i^^ ^ 
with media containing progressively higher 
35 concentrations of the amplifying reagent, for ' . 

example, 0.5-200 fiH of methotrexate for the DHFR gene, 
and may be analyzed at each selection step for 
production of the target product. Expansion will * 
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IncludG at Isaac dizplication and may result in a.. 
les^st 5 copies, preferably 10 copies or aiore ir* a 
tandem relationship. Thus protein production will be 
increased at least 1.5 fold from expression from a 
single copy, usually at least 3 fold/ preferably at 
least 5 fold. 

The various clones may then be screened for 
optimum stable production of the target product and 
these clones may then be expanded and used 
commercially for production in culture. In this 
manner, high yields of a product may be obtained, 
without the necessity of isolating the message and 
doing the various manipulations associated with 
genetic engineering or isolating the genomic gene, 
where very large genes can be a major research and 
development effort. 

The following examples are offered by way of 
illustration and not by way of lixnitation. 

EXPERIMENTAL 

Cells 

Normal human diploid skin fibroblasts, 
("primary recipient") are propagated in EEMEM medium 
supplemented with 20% fetal calf serum. Dihydro folate 
reductase (DHFR) deficient Chinese hamster ovary (CHO) 
DUKX-Bll cells (Urlaub and Chasin, Proc. Natl. Acad. 
Sci . USA 77:4216-4220 (1980)) ( "secondary recipient " ) 
are propagated in alpha -medium supplemented with 10% 
dialyzed fetal bovine serum. 

DNA Vector 

The amplification vector is constructed 
from PUC19 (Yanisch-Perron et al., Gene 33:103-119 
(1985)). A 1.8 kb Haell fragment containing a ; 
hygronrycin B phosphotransferase gene (hph) driven by 
the herpes simplex virus thymidine kinase (HSV tk) 
promoter is isolated from pHyg (Siigden et al. , Mol . 
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C5l2 . Bio-L 5s410-413 (19J:(5)} by digestion with HaelT 
and gel eloctrophoresis . Syptbetic adaptors are edded 
onto this fragment to convert the Hae ll ends into 
Hin di II ends and the resulting fragment is joined to 
5 pUC19 digested with Hin di II. The resulting plasmid 
pUCH contains the hygromycin cassette such that 
transcription of hph and beta-lactamase are in the 
opposite orientation. A 1.3 kb Sai l fragment 
containing a DHFR gene driven by SV40 transcriptional 

10 signals is isolated from pTOD (Connors et al., DNA 
7:651-661 (1988)) by digestion with Sai l and gel 
electrophoresis. This fragment is ligated to pUCH 
digested with Sai l . The resulting plasmid pUCD 
contains the DHFR cassette such that DHFR and are 

15 transcribed in the same direction. A 1.76 kb BamH I 

fragment from the phage F15 (Friezner Degen et al . , J . 
Biol. Chem . 261 :6972-6985 (1986)) which contains 1.45 
kb of DNA flanking the transcriptional start of human 
tissue plasminogen activator (t-PA) in addition to the 

20 first exon and part of the first intron is isolated by 
gel electrophoresis after BamH I digestion. This 
fragment is joined to pUCD following digestion of the 
latter with BamH I. The resulting plasmidLpUCG has 
the promoter of the t-PA fragment oriented opposite to 

25 that of the DHFR cassette. The t-PA fragment contains 
a single Nco l site, which is not unique to pUCG. A 
partial Nco l digest is carried out and a Not I linker 
is inserted. The resulting plasmid pCG contains a 
unique Not I site in the t-PA fragment which allows the 

30 plasmid to be linearized prior to transformation of 
the primary human diploid fibroblasts in order to 
increase the frequency of homologous recombination 
(Kucherlapati et al., Proc* Natl. Acad. Sci . USA 
81:3153-3157 (1984)). 

35 

Preparation of Primary Recipients 

^ The plasmid pCG linearized with Not I is ^ 

introduced into the primary recipients by 



electroporation employing DNA at. lOnM, The resulting 
cells ere then grown in selective medium (EEMEM with 
200 fig/ml hygromycin B) . Resistant colonies are 
isolated and analyzed, by PGR (Kim and Smithies , -- . 
Nucleic Acids Res . 16:8887-8903 (1988)) using as 
primers the sequences GCGGCCTCGGCCTCTGCATA and 
CATCTCCCCTCTGGAGTGGA to distinguish homologous 
integrants from random ones. Amplification of 
cellular DNA by PGR using these two primers yields a 
fragment of 1,9 kb only when DNA from correctly 
targeted cells is present. Cells comprising the DHFR 
gene integrated into the t-PA region are expanded and" 
used as a source of genetic material for preparation 
of secondary recipients . 

Preparation of Secondary Recipients 

Metaphase chromosomes are prepared Nelson et 
al., J. Mol. Appl. Genet . 2:563-577 (1984)) from 
recipients demonstrating homologous recombination with 
the DHFR and are then transformed in DHFR-def icient 
CHO cells by calcium phosphate mediated gene transfer 
(Nelson et al . , J. Mol. Appl. Genet . 2:563-577 
(1984))* The cells are then grown in selective medium 
(alpha-medium containing 200 fig/ml hygromycin B) . 
Resistant colonies are isolated and analyzed for 
expression of human t-PA (Kaufman et al . , Mol. Cell. 
Biol . 5:1750-1759 (1985)). The cell clones are then 
grown in selective medium containing progressively 
higher concentrations of methotrexate (.02-80 /*M, with 
steps of 4- fold increases in concentration) . After 
this amplification procedure, the cells are harvested 
and the human t-PA is analyzed employing an ELISA 
assay with a monoclonal antibody specific for t-PA 
(Weidle and Buckel, Gene 51:31-41 (1987)). Clones 
providing for high levels of expression of t-PA are 
stored for subsequent use. 



Isolation of a Genomic Clon e-: Ccncainiiuj 
Sequences for TarcTBting 
Erythropoietin 
A clone was obtained by screening a huinan 
5 placental DNA genomic library (Clontech) in EMBL 3- 
SP6/T7 using two 36 bp oligonucleotide probes 5'- 
CTGGGTTGCTGAGTTCCGCAAAGTAGCTGGGTCTGG-3 ' and 5'- 
CGGGGGTCGGGGCTGTTATCT6CATGT6TGCGTGCG-3 ' to the 
presumed promoter region of human erythropoietin. 
10 From this clone two subclones were created in pSP72 

(Krieg and Melton (1987) Meth. Enzymol. 155 , 397-415), 
one containing a 5 kb BamHI-Hindlll fragment from the 
region upstream to the coding region of EPO (pTD.l) 
and one containing a 5 kb Hindlll-BamHX fragment 
15 coding for EPO (pTD.2). 

Construction of DNA Fragment for 

TarqetincT Erythropoietin 
A plasmid pCG.l was constructed by 

20 replacement of the polylinker of pBluescript SK(-) 
(Stratagene) between the Sad and Rpnl sites with a 
synthetic double stranded 72 base pair DNA fragment 
(FIG. 1). Referring to FIG. 2, into pCG.l was cloned 
between the Hindlll and Xbal sites a 678 bp fragment 

25 containing the enhancer and promoter of the immediate 
early gene of human cytomegalovirus (CSSV, Boshart et 
al (1985) Cell 41, 521-530) obtained by a PCR 
amplification of the plasmid pUCH.CMV (gift of 
m:. CaloS/ Stanford U.) using the oligonucleotide 

30 primers 5'- 

CGCCAAGCTTGGCCATTGCATACGTT-3 ' and 5'- 

GAGGTCTAGACGGTTCACTAAACGAGCTCT-3' in order to engineer 
Hindlll and Xbal sites respectively onto the ends of 
the resultant fragment. The resultant plasmid pCG.CMV 
35 was used for further constructions. 

The 620 bp BstEII-Xbal fragment from the 
pTD.2 was joined by the use of a BstElI-Xbal adapter 
to pCG.CMV jT'^s^-rlcted wlt.a Mn- T 1:o create th plasmid 



pCC^CMV/EPO, in ^/hlch t.he BEtEII site of K^^O 
fragment is next to the promoter end of the CMV 
fragment. Into pCG.CMV/EPO was cloned successively a 
1.94 kb fragment encoding methotrexate resistance from 
5 the plasraid pSV2dhfr (Subramani et al (1981) Mol . 
Cell. Biol. 1, 854-864) and a 1.15 kb fragment 
encoding G418 resistance from the plasmid pMClneo 
polyA (Thomas and Capecchi (1987) Cell 51, 503-512). 
The neo gene was obtained as an Xhol-Sall fragment and 
10 the dhfr gene was obtained by PGR amplification using 
the primers 5 ' - 

GGACGCGTGGATCCAGACATGATAAGATA-3' and 5'- 
GGACGCGTCAGCTGTGGAATGTGTGTCAG-3' designed to add Mlul 
sites at the ends of the resultant fragment. The neo 

15 and dhfr genes were cloned into the Xhol and Mlul 

sites respectively of pCG.CMV/EPO to give the plasraids 
pCG.CMV/EPO/DHFR and pCG.CMV/EPO/Neo/DHFR such that 
their transcription is in the same orientation as that 
of CMV. Finally, the 5 kb BamHI-Hindlll fragment from 

20 pTD.l was added via Clal adapters at the Clal site of 
pCG.CMV/EPO/Neo/DHFR to give pCG.HRl. In pCG.HRl, the 
5' 5kb EPO fragment is in the same orientation as that 
of the 620 bp BstEII-Xbal fragment with respect to the 
original lambda clone. 

25 A 9.54 kb fragment containing the 5' 5)cb 

BamHI-Hindlll EPO fragment, the dhfr and G418 markers, 
the CMV enhancer/promoter and the 620 bp BstEII-Xbal 
EPO fragment can be released from pCG.HRl as a Not I or 
SacII fragment. This NotI fragment can be used for 

30 homologous recombination as, it is designed to serve as 
an omega structure in recombination having 5 kb and 
620 bp of homology to facilitate the event (FIG. 3). 

For electroporation, the DNA was first cut 
with NotI, then extracted with phenol /chloroform and 

35 precipitated by the addition of ethanol before 
centrifugation. The resultant DNA pellet was 
resuspended cit a concentration of 2 mg/ml in a volume 
(10 /il) of 10 iC'-'A Tris-HCl, 1 mM EDTA (TE) . 



Introduction of DMA i nto cells 
Transformed primary human 293 embryonal 
kidney cells (ATCC CRL 157 3) w^re cultured in Cellgro 
5 DMEM H16 (Mediatech) supplemented with 10% calf serum, 
glutaraine (2 mM) and penicillin 

(100 U/ml) /streptomycin (0.1 mg/ml) and grown at 37 °C 
in 5% COj. At 90% confluency, cells were prepared for 
electroporation by trypsinization/ concentration by 

10 brief centrifugation and resuspension in PBS at 10 

cells/0.8 ml. The cells were equilibrated at 4**C, and 
DNA (50 ^g) restricted with NotI (as described above) 
was added. The mixture was electroporated at 960 fiF 
and 260 V with a BioRad Gene Pulser and then iced 

15 again for 10 min before plating onto a 10 cm dish. 

After incubation at 37°C for 48 hr, the cells from a 
10 cm dish were split equally among 5 2 4 -well plates 
in media containig G418 at 0.6 mg/ml (effective 
concentration) . Under these electroporation 

20 conditions, 4-10 colonies/well survive drug selection 
after 2 weeks. 

Detection of Homologous Recombination by PGR Analysis 
Using NotI restricted DNA from pCG.HRl, 

25 successful homologous recombination is obtained by 

insertion of the 3.8 kb construct at the targeted EPO 
locus while simultaneously deleting 1.2 kb of genomic 
sequence (FIG. 3), PCR is used to detect unique 
targeting events versus random integration of the DNA 

30 as diagrammed in FIG. 4. Two primers are synthesized, 
one to the 3' end of CMV and the other to the region 
3' to the Xbal site used for the 620 bp BstEII-Xbal 
fragment in the targeting DNA, A homologous 
recombination event generates a DNA target in the 

35 genome from which these primers produce an 
aiii:plif ication product of 860 bp. 

In ord r to detect the targ ting event, 
y- clones (from the electroporated 293 cells) 
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from 4 wells each (representing about 16 colonies) 
were geueircited hy trypsinizing wells and using 90% of 
each well for the pool. The remaining 10% of each 
well was then reseeded back into the well. Genomic 
5 pNA was then prepared from each pool as follows. The 
cells in each pool were pelleted by centrif ugation for 
2 min. in a 1.5 ml microcentrifuge tube, resuspended 
in PBS (20 ^1), and treated for 1 hr at 37oc with a 
solution (400 /il) containing 10 mM Tris-HCl (pH7.5), 

10 100 mM NaCl, 5 mM EDTA, 1% SDS and RNase A (40 ftg/ml) . 
Proteinase K (10 /ilf 10 mg/ml) was then added, and the 
samples were incubated for 4 hr at 5 0^*0 before 
extractions by vigorous vortexing with 
phenol /chloroform (200 fil each), then with chloroform 

15 (400 /il), the addition of ethanol (800 fil) , and 

centrif ugation at 25°C for 10 min. The DNA pellets 
were washed with 70% ethanol, dried and resuspended in 
TE (20 ;*1). An average of 40 fig of genomic DHA was 
obtained from each sample. 

20 Approximately 1 /tg from each sample of 

genomic DNA was used for PCR analysis. The DNA in a 
volume (10 ;*1) of TE was boiled for 10 min. prior to 
the addition of PCR mix (40 fil) . The reaction (50 fiX) 
contained 10 mM Tris-HCl (pH 9.0 at 25*>C) , 50 mM KCl, 

25 1.5 mM MgClj/ 0.01% gelatin, 0.1% Triton X-100, 200 ^M 
dNTPS/ 1 /iM each of the primers 
5'-AAGCAGAGCTCGTTTAGTGAACCG-3' and 5'- 
TGAGCGTGAGTTCTGTGGAATGTG-3 ' , and 1 . 5 U of Tag DNA 
polymerase (Promega). Following an initial incubation 

30 of 940C for 3 min, the samples were subjected to 45 
cycles of denaturation at 94»C for 1 min., annealing 
at 66°C for 1.5 min. and extension at 72oc for 2 min. 
At the end of the 45 cycles, the samples were 
incubated an additional 5 min. at 72«*C. A por\.ivjn 

35 (20 ;*1) of each sample was analyzed on a 1% agarose 

gel run in TBE and stained with ethidium bromide. Out 
of the 90 pools analyzed from 3 electroporations , two 
samples were identified which exhibited the correct 
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size frac:rf:eiit 'C; Jslhidiiira bromide stalaluci. The DNA 
from the PCR reaction was recovered and subjected to 
restriction mapping with Xbal . The correct 
amplification product should upon treatment with Xbal 
yield two fragments, 669bp and 191bp. The samples 
from the two pools both yield fragments of the correct 
sizes. In addition, the sample from pool 1 exhibits 
other bands in the uncut material. 

Following the procedure described 
previously, metaphase chromosomes are prepared from 
the recipients demonstrating homologous recombination 
with DHFR and transformed in DHFR deficient CHO cells. 
After isolating resistant colonies and analyzing for. 
expression of EPO, the cell clones are grown in 
selective medium containing progressively higher 
concentrations of methotrexate (.02-80 /xH) with steps 
of 4- fold increases in concentration. The cells are 
then harvested, cloned and screened for production of 
EPO. Clones providing for at least 2-fold enhancement 
of EPO production are isolated. 

It is evident from the above results, that 
the subject method provides for a novel approach to 
expression of a wide variety of mammalian genes of 
interest. The method is simple, only requires the 
knowledge of a sequence of about 300 bp or more in the 
region of a target gene, and one may then use 
substantially conventional techniques for transferring 
the amplifiable region to an expression host, and 
production of the desired product in hiigh yield. 

All publications and patent applications 
cited in this specification are herein incorporated by 
reference as if each individual publication or patent 
application were specifically and individually 
indicated to be incorporated by reference. 

Although the foregoing invention has been 
described in some detail by way of illustration and 

xample for purpos s of clarity of understanding, 1: 
will b readily appar nt to those of ordinary rk.i ? ■ I 
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ths nrt ill light of the t'J5jichi.ngs of this :i-n\-3nclon 
that certain changes and modifications may hv rnndp 
thereto without departing from the spirit or scope of 
the appended olaims . 



WHAT IS CIxAI^ffiP IS : 

1. A. method for producing mammalian 
proteins comprising: 

growing mammalian secondary expression host 
cells comprising multiple copies of an amplifiable 
region comprising a target gene heterologous to said 
secondary expression host and expressing a protein of 
interest and an amplifiable gene, whereby said target 
gene is expressed and said protein is produced; 

wherein said secondary host expression cells 
are produced by the method comprising: 

transforming primary mammalian cells 
comprising said target gene with a construct 
comprising an amplifiable gene and at least one 
flanking region of a total of at least about 150 bp 
homologous with a ONA sequence at the locus of the 
coding region of said target gene to provide 
amplification of said target gene, wherein said 
amplifiable gene is at a site which does not 
interfere with the expression of said target gene, 
whereby said construct becomes homologously integrated 
into the genome of said primary cells to define an 
amplifiable region; 

selecting for primary cells comprising said 
construct by means of said amplifiable gene or other 
marker present in said construct; 

isolating DNA portions of said genome from 
said primary cells, wherein said portions are large 
enough to include all of said amplifiable region; 

transforming secondary expression host cells 
with said primary cell DNA portions and cloning said 
transformed secondary expression host cells to 
produce clones of said secondary expression host 
cells differing in said DNA portions present in said 
secondary expression host (.^ells: 
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i-jc\~ clones of ssu.ld j<»^i(ijaal iuu secondery 
expression hoert cells comprisino said atnpixf iable 
region; and 

amplifying said amplifiable region by means 
5 of an amplifying agent, wherein said amplifying is 
prior to said isolating or after said selecting and 
prior to said growing. 

2. A method according to Claim 1, wherein 
said amplifiable gene is a mammalian DHFR gene. 
10 3, A method according to Claim 1, wherein 

said portions are metaphase chromosomes. 

4. A method according to Claim 1, wherein 
said portions are restriction fragments. 

5. A method according to Claim 1, wherein 
15 said primary cells are human cells. 

6 . A method according to Claim 5 , wherein 
said human cells are fibroblast cells. 

7. A method according to Claim 1, wherein 
said construct comprises a biocidal marker providing 

20 resistance to a biocide for said primary host cells. 

8 . A method for producing mammalian 
proteins comprising: 

transforming mammalian prdlmary mammalian 
cells comprising said target gene with a construct 

25 comprising an amplifiable gene and at least one 

flanking region of at least about 150 bp homologous 
with a DNA sequence within 50 kb of the coding region 
of said target gene, wherein said amplifiable gene is 
at a site which does not interfere with the 

30 expression of said target gene, whereby said 

construct becomes homologous ly integrated into the 
genome of said primary cells to define an amplifiable 
region comprising said amplifiable gene and said 
target gene in said genome; 

35 selecting for primary cells comprising said 

construct by means of said amplifiable gene or other 
marker present l i said construct; 



isolaL.\..iy c:::/. portions of sai:^. geno^ne .-nroin 
said primary cells, wherein said portior.s are large 
enough to include all of said amplifiable region; 

transforming mairaaalian secondary expression 
5 host cells with said primary cell DNA portions, 

wherein said secondary expression host cells are of a 
different species from said primary host cells, and 
cloning said transformed secondary expression host 
cells to produce clones of said secondary expression 
10 host cells differing in said DNA portions present in 
said secondary expression host cells; 

selecting clones of said mammalian secondary 
expression host cells comprising said amplifiable 
region; 

15 amplifying said amplifiable region by means 

of an amplifying agent, wherein said amplifying is 
prior to said isolating or after said selecting; and 

groing said secondary expression host cells 
comprising multiple copies of said amplifiable region, 

20 whereby said target gene is expressed and said protein 
is produced. 

9, A method according to Claim 8, wherein 
said amplifying is with said secondary expression host 
cells . 

25 10. A method according to Claim 8, wherein 

said primary cells are human cells . 

11. A method according to Claim 10, wherein 
said human cells are diploid fibroblast cells . 

12. A method according to Claim 8, wherein 
30 said amplifiable gene is a mutated DHFR gene having a 

higher Km than the wild-type gene. 

13. A method according to Claim 12, wherein 
said secondary host expression cell is DHFR d»flc\ent. 

14. A method according to Claim 8, wherein 
35 said construct further comprises a marker gene 

separated from said amplifiable region by an 
homologous flanking region. 
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if) K bum.?.! Ct;Ll oonfp.r3.sing an amplit iable 
ger«e nther than its Id -type site in the hurra n 
genome and within the locus of a target gene 
expressing a protein to provide amplification of said 
5 target gene. 

16. A human cell according to Claim 14, 
wherein said cell is a normal cell. 

17. A human cell according to Claim 14, 
wherein said cell is a neoplastic cell. 

10 18. A human cell according to Claim 14, 

wherein said amplifiable gene is a DHFR gene. 

19 . A mammalian cell other than a human 
cell for expression of mammalian proteins in culture 
comprising an amplifiable region comprising an 

15 amplifiable gene within lOkb of a human wild-type 

gene expressing a protein, wherein said two genes are 
separated by substantially solely human wild- type 
sequence associated with said target gene and the 
flanking sequence associated with the amplifiable 

2 0 gene . 

20. A method for producing, cells for 
expression of a heterologous protein in culture, said 
method comprising: 

transforming mammalian primary cells 
25 comprising said target gene with a construct 

comprising an amplifiable gene and at least one 
flanking region of at least about 150bp homologous 
with a DNA sequence within lOlcb of the coding region 
of said target gene, wherein said amplifiable gene is 
30 at a site which does not interfere with the expression 
of said target gene, whereby said construct becomes 
homologous ly integrated into the genome of said 
primary cells to define an amplifiable region 
comprising said amplifiable gene and said target gene 
35 in said genome; 

selecting for primairy^ cells comprising sai4 
construct by means of said amplifiable gen or other 
marker present in said construct; 



.jiClai-lAg DNA port i.oyij oi: a aid genome frc ffi 
said prijnary cells, wherein said portions are large 
enough to include all of said amplifiable region; 

transforming mammalian secondary expression 
host cells with said primary cell DNA portions, 
wherein said secondary expression host cells are of a 
different species from said primary host cells, and 
cloning said transformed secondary expression host 
cells to produce clones of said secondary expression 
host cells differing in said DNA portions present in 
said secondary expression host cells; 

selecting clones of said mammalian secondary 
expression host cells comprising said amplifiable 
region; and amplifying said amplifiable region by 
means of an amplifying agent, wherein said amplifying 
is either prior to said isolating or after said 
selecting. 

21. A method according to Claim 20, wherein 
said amplifying is with said secondary expression host 
cells . 

22. A method according to Claim 20, wherein 
said primary cells are hxmian cells . 

23. A method according to Claim 22, wherein 
said human cells are diploid fibroblast cells. 

24. A method according to Claim 20, wherein 
said amplifiable gene is a mutated DHFR gene having a 
higher Km than the wild-type gene. 

25. A method according to Claim 24, wherein 
said secondary host expression cell is DHFR deficient. 
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